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Abstract — We propose an Euclidean geometric representation 
for the classical detection theory. The proposed representation 
is so generic that can be employed to almost all communication 
problems. The hypotheses and observations are mapped into R N 
in such a way that a posteriori probability of an hypothesis 
given an observation decreases exponentially with the square of 
the Euclidean distance between the vectors corresponding to the 
hypothesis and the observation. 

I. Introduction 

One of the oldest and the most frequently employed tools in 
science is the Euclidean geometry. Visualizing a problem in an 
Euclidean geometry improves our understanding and usually 
contributes to the solution. Hence, Euclidean geometry plays 
a central role in many different disciplines such as astronomy, 
mathematics, and geography. The detection theory is not an 
exception. 

The detection problem consists of a finite set of hypotheses, 
a set of observations, and a probabilistic transition mechanism 
between the hypotheses and the observations. Then the goal 
of the detection can be summarized as to guess the occurred 
hypothesis from the given observation (TJ. 

In this paper we are interested in a particular case of the 
detection problem. That is detecting the transmitted symbol in 
the receiver of a communication system. In a communication 
system a symbol X from a finite alphabet A is transmitted 
through an imperfect channel. The channel stochastically maps 
this transmitted symbol to an output Y which is an element of 
an observation space 1Z. 1Z may consist of a finite number of 
symbols, real numbers, or functions depending on the problem. 
The goal of the receiver is to predict X given Y. 

What we mean by an Euclidean geometric representation 
of the detection problem is mapping the alphabet A and the 
observation space K to R N which is a well known finite 
dimensional Euclidean space. This mapping of the symbols 
and observations should posses some properties in order to 
be useful. First of all, this mapping should be in such a way 
that the Euclidean distance between two vectors representing 
a symbol and an observation should reflect the probabilistic 
transition mechanism of the detection problem. In a com- 
munication system we usually transmit a stream of symbols 
through a channel rather than transmitting just a single symbol. 
Therefore, this representation should easily be extended to the 



multiple usages of the channel. Finally the translation of the 
vectors should have a useful meaning. 

The Euclidean geometry is employed in the communica- 
tion theory for a particular problem which is communication 
through additive white Gaussian noise (AWGN) channels. The 
signal space representation posses many nice properties that it 
became the standard representation for explaining this problem 
UK El, J3|. However, the application of the signal space 
representation is restricted to the detection under AWGN. 

Our goal in this paper is to develop a general Euclidean 
geometric representation for the detection theory which fits 
to almost all of the communication problems, such as non- 
Gaussian noise channels or even discrete output channels as 
well as AWGN channels. Since we would like our represen- 
tation to be a generic one, we have to sacrifice from some of 
the properties that signal space representation has. However, 
the developed representation still shares certain important 
properties of the signal space representation. 

In this work our aim is to provide a geometric viewpoint 
for the detection problem. Hence, we are not concerned with 
the computation of the a posteriori probabilities or likelihoods 
and we assume that these values are known. 

While mapping 7Z to H N we make use of the log-likelihood 
ratio which is a very well known concept in communication 
theory. In other words, we show that log-likelihood ratio is 
not only a parameter for optimum decision but also provides 
a full geometric representation for the observations. 

Developing a geometric representation for detection theory 
is first attempted by Dabak in |4|. However, the geometry he 
proposed is not Euclidean, not even Riemannian. On the other 
hand, the geometry that we propose is Euclidean. 

This paper is organized as follows. In the next section the 
desired properties from a geometric representation are ex- 
plained. The proposed geometric representation is introduced 
in the third section. The fourth section provides some examples 
and then the paper is concluded. 

Throughout the paper we adopt the following notation. We 
denote deterministic variables by lowercase letters whereas 
random variables by uppercase letters except letter C which 
is employed in denoting the normalization constants. Boldface 
letters denote vectors. Finally sets are denoted by calligraphic 
letters except M. N . 



II. The properties expected from 
an Euclidean geometric representation 
of the communication problem 

We need two operators and a real function to formally 
describe an Euclidean geometric representation of a commu- 
nication channel. The first operator is required to map the 
symbols in the alphabet A to the vectors in Mr. This operator 
is denoted by Ma {•}• The second operator is used to map 
the observations in 7Z to M and denoted by Mr, {.}. Finally 
we need a function /(.) from M + U {0} to M + to describe the 
a posteriori probability of the transmitted symbol X given the 
observation Y in terms of the distance between Ma {X} and 
M-r{Y} as follows. 



Pr{X = x\Y = y} 



^f(\\M n {y}-M A {x}\\) (1) 



pN 



and the normalization 



where . denotes the usual norm in 
constant C is given by 

C= ]T f(\\Mn{Y}-M A {x}\\) 

The most important requirement to be imposed on the 
geometric representation is related to the function /(.). The 
function /(.) should be a monotonically decreasing function. 
In other words Pr{X|y} should decrease as the distance 
between Ma {X} and Mr {Y} increases. 

As it is mentioned in the introduction part, we usually trans- 
mit a sequence of symbols through a communication channel 
and the geometric representation should be in consistent with 
the multiple usage of the channel. Suppose that a vector X = 
[X\, X 2 , . . • , Xm] is to be transmitted through the channel by 
using it M times. We assume that channel is memoryless. Let 
the observations be denoted by Y = [Yi, Y2, ■ ■ ■ , Ym]. The 
combined alphabet in this problem is A M and the observation 
space is 1Z AI . We need two operators from these alphabet and 
observation space to M NM . In order to keep the consistency 
with the single usage of the channel, these operators can be 
defined simply by concatenating the Ma {Xi} and M-r, {Yi] 
for i = 1, 2, . . . , M as follows. 



M a m {X} 



M u m{Y} 4 



M A {X 1 } 

M a {X 2 } 

Ma{X m } 

M n {Y{\ 
M-r {Y 2 } 



(2) 



(3) 



For 



Mr {Y m } 

the sake of consistency we desire that the a 

posteriori probability of X given Y is related to 

\\M-rm {Y} — Ma** {X}|| with the same function /(.) as 

in the single usage case. In other words we would like to 
have 

Pr{X|Y} = ±-J(\\M n M {Y} - M a m {X}|| ) (4) 



where C\ is a normalization coefficient. Since, the norm 
appearing in the equation above is an Euclidean norm 



Pr{X|Y} 



1 



J2\\Mn{Y}-M A {X t }\\ 



(5) 



Since, we assumed the that channel is memoryless 



M 



Pr{X|Y} = []Pr{X 2 |r 4 } 

i=l 

M 



(6) 



where di = \\Mr {Yi} — Ma Combining this result 

with © imposes a further condition on /(.). That is /(.) 
should satisfy the following equation 



A I 



=«IIM) 



(7) 



for all positive di and an a in R. There is only a single form 
of function which satisfies this requirement. In order to satisfy 
this requirement /(.) is necessarily in the form of 



/(x) = ^exp( 7 a; 2 ) 



(8) 



where (3 is a positive real number. Recall that the function 
/(.) should be a monotonically decreasing function. Hence, 7 
should be a negative real number. 

In order to make the Euclidean geometric representation 
more useful we should assign a useful meaning to the transla- 
tion (vector addition in 1^) operation. In some specific com- 
munication channels, such as AWGN channel, the observation 
space 1Z has well defined addition and scaling operations. If 
this was true for all communication channels then we could 
assign a meaning to the translation operation by imposing a 
linearity constraint on the operator Mr{.}. However, there 
are many channels with an observation space which do not 
posses a well defined addition operation. Therefore, we have 
to adopt a more general approach for assigning a meaning to 
the translation operation. 

We assign a meaning to translation operation related to 
the multiple usage of the channel again. However, in this 
case we assume that M replicas of an equally likely dis- 
tributed symbol X is transmitted through the channel instead 
of an arbitrary vector X. Let the observations be denoted 
by Y = [Yi, Y 2 , . . . , Ym]- A straight forward approach for 
developing an Euclidean geometric representation for this 
problem would be using the operators defined in (0 and (0 
with X = [X, X, ...,X]. However, a better representation 
can be developed by making use of the fact that M replicas 
of the same symbol is transmitted in this case. We propose 
the following operator from TZ M to R N (rather than R NM ) to 
map Y resulting from transmitting M replicas of an equally 



likely distributed symbol through the channel. 

M 



M U M{Y}^J2Mn{Yi} 



(9) 



Then the translation operation may become useful if the a 
posteriori probability of X given Y can be expressed as 



Pr{X|Y} 



f(\\M n M{Y}-M A {X}\\) 



J2 M n{Y t }-M A {X} 



i=l 



(10) 



III. The Transformations from A and TZ to 



The necessary properties that an Euclidean geometric repre- 
sentation should posses are laid down in the previous section. 
In this section, we propose the operators Ma{-}> Mn{-} 
and then prove that these operators satisfy conditions imposed 
in the previous section. 

First of all the codomain of and Mn{-} should 

be more clearly specified. The input symbols and channel 
observations are mapped to by A^^i{.} and Mn{-} 
where N is the number of elements in the alphabet A. 

We need to enumerate the input symbols before defining 
Ma{-}- Let me elements of A be {xi, X2, ■ ■ ■ , xn}- Then 
Ma {■} is defined from A to R N as 



2N 



2N 2 



N 

E« 

i=i 



(11) 



where is the i th canonical basis of M. N . 
vector is 

i In - 
2n\I n 



The norm of this 



\\M A {xi}\\ 



(12) 



for any Xi in A. Hence, all of the input symbols are mapped 
to vectors at equal distance from the origin. Moreover, the 
distance between Ma {xi} and Ma { x j} for anv x i an d x j 
is 

\\M A {xi} - M A {xj}\\ = -^=— . 

Hence, the input symbols are mapped to equidistant vectors. 
When these two facts are combined it can be deduced that the 
range space of the alphabet A is composed of the vertices of 
a regular polyhedron whose center of mass is at the origin. 
For instance, for N — 3 the the input symbols are mapped to 
the vertices of an equilateral triangle with center of mass at 
the origin. 

Mapping an observation depends on the a posteriori prob- 
abilities of the input symbols given the observation occurred. 
Hence, we inherently assume that a posteriori probabilities are 
well known. With this assumption we propose the following 
definition for M-ji {y} for a y in TZ. 

Mn{y\ = 2^ lQ g ~m ~ ~ — 7 e »- (I 3 ) 



»=i 



Ilf=iPr{* 



Xj \Y = y} 



If all the input symbols are equally likely then this definition 
can also be expressed in terms of likelihood functions as 
follows. 



M 



N 



K M = — - 

8=1 



(Pr{F = y\X = xf}) 



N 



YlU F *{Y = y\x = x j } 



(14) 



Notice that this equation describes a sort of log-likelihood 
ratio. For instance, for N = 2 (fl4l) reduces to 



M n {y} = 



log 
log 



Pr{F= 


-y 


X 




Pr{Y = 
Pr{Y = 


-y 
-y 


X 
X 


= X2} 


Pr{Y = 


-y 


X 


=xi} 



The definition of the transformation M-r. {.} given in ([131 ) 
restricts the channels on which our representation is applicable. 
If Pr{X — Xi \Y — y} is zero for an Xi in A and a y in TZ then 
M-ji {y} is not in Mr. Therefore, our representation can not 
be applicable to those channels if an observation rules an input 
symbol out, for instance, the binary erasure channel. This is 
the sole restriction that our representation has. 

The range space of the operators Ma {■} an d Mn{..} are 
squeezed into a certain hyperplane of M. . Let a subset of 
be defined as 



[ 1 1 



1 ]p = 0} 



Clearly, V is a N — 1 dimensional subspace of M N . More 
specifically, V is a hyperplane. For any Xi in A 



1 



1 N 1 

1 •■• 1 \M A {xi\ = V — 



2N 
= 0. 



j'=i 



2N 2 



Therefore, Ma {x{\ is in V for any Xi in A. Similarly, for 
any y in TZ 



[ 1 1 



1 ]M n {y} = 

flog Pj{x = ^\y = y}f =0 , 



nf=i?*{X = x j \Y = y} 

Hence, Miz {y} is also in V for any y in TZ. Therefore, the 
operators ^^{.j and Mtz{.} actually map the detection 
problem into an N — 1 dimensional Euclidean space. 

After introducing how to map the input symbols and ob- 
servations to M N , we have to prove that these transformations 
satisfy the necessary requirements imposed in the previous 
section. 



Theorem 1. For any Xi in A and any y in TZ 

1 



Pr{X = Xi \Y = y) 



C 



exp 



(- lly- x » 



(15) 



where y = M-ji{y}, = Ma{x%}> an d the normalization 
constant C is 



N 

c = ex p i - y - x ■ 



Proof: Due to the definition of Ma {x^} we have 

2 



y-Xi 



1 



v e,- -i ^ 

J 2N 2N 2 



1 W 

/V2 Y 



3=1 



(y)i 



D 



(16) 



where (y), denotes the i th component of vector y and D is 
given by 



D 



+ 2AT2 X! e J 

3=1 

Similarly, C can be expressed as 



1 



1 



27V 3 4iV 2 



N 



C = cxp(-D)Y, 



(y)3 



3 = 1 



M 



(17) 



The right hand side of ( fl"5T l can be derived using ( [TBI and ( fTTI i 

as 

1 



c exp 



k . x ||^_ exp(( y yiV) 

7 E 7 =i ex p((y)3'/ iV ) 



Inserting the definition of .M-^ {y} in this equation yields 

Pi{X=Xj\Y=y} 



1 

c 



exp 



I y - x * 



(nf =1 Pr{x^ir= y }) 1/w 
Ef =1 Pr{x^|y= y } 

(nf =1 Pr{X=x 3 |r =!/ }) 1/JV 

Pv{X = Xj \Y = y} 



which completes the proof. ■ 
We have stated the desired form of the relation between 
Pi{X\Y} and \\M A {X} - M n {Y}\\ in fl). Moreover, we 
have explained that the function /(.) should be in a certain 
form given in ([8j. The Theorem [JJ shows that the operators 
Ma {■} and M-ji {.} satisfy these requirements. 

The next theorem shows that the operators .M^i{.} and 
M-r, {.} satisfy ( fTOb . 

Theorem 2. Let M replicas of an equally likely distributed 

symbol X transmitted through a memoryless channel and the 
vector Y = [Yi, Y%, . . . , Ym] is received. Then a posteriori 
probability of X given Y is 



1 





M 








(19) 




3=1 





Pr{X = a^Y = y} = - exp 



where yj = Mn {yj} Q x,; = Ma {xi}, and the normaliza- 
tion constant C is 



'Notice that y is a vector in 1Z M whereas yj is a vector in 



Proof: Since, the channel is memoryless 

M 

Pr{Y = y\X = Xi } = H Pr{Yj = Vj \X = x t } 

3=1 

Using this equation, Bayes' theorem, and the fact that X 
is identically distributed, the a posteriori probability of X 
becomes 



IljliPr{* = ^l*j=yj} 
T,tiU?=iPr{X = x k \Y i = y i } 



Pi{X = Xl \Y = y} = 



exp 



Efc=i ex P (- E^i llyj - x fel 

The summation in the numerator can be evaluated as 

M M 

Y \\yj ~ x »n 2 = Y (yJyj - yJ Xi ~ x ^yj + x ^ Xi 



(20) 





M 


2 


M 


2 

M 


(18) = 


Y y i ~ X i 






+^||y j || 2 +(M-i)||x i || 2 




3=1 




3 = 1 


3 = 1 



N 
















') 




fc=l 




3=1 




This 



The summation in the exponential function in the denominator 
can be evaluated similarly as 

2 

M M M 

Y^hi - x fcii 2 = ^yj-^k - Y,yj 

3=1 3=1 3=1 

M 

£|| yj || 2 + (A/-l)||x fc || 2 

3 = 1 

||xfc|| 2 appearing in this equation can be replaced with ||xi|| 2 
due to ( fT2b . Then, inserting these expressions to replace 
summations inside ( f20b completes the proof. ■ 

A. Maximum a posteriori decision rule 

The maximum a posteriori (MAP) decision rule can be 
expressed within this geometric framework as follows. 

xmap = argmaxPr{X = x 4 \Y =y} 

X 

= argmax-^ exp(- ||y - 3C*||) 

x O 

where x, = Ma {xi] and y = Mn {y}. Since, C does not 
depend on x we can safely get rid of it. 

xmap = argmaxexp(- ||y - Xi||) 

X 

= arg max - ||y - x*|| 

X 

= argmin||y - Xi|| (21) 



the distance between Ma{xz} and Mu{y}, which is an 
expected result. 



x 2 (-1/18,1/9,-1/18 



f Q\ a x,(1/9,-1/18,-1/18) 



< 3 (-1/18,-1/18,1/9) 



Fig. 1 . The plane V corresponding to the Euclidean geometric representation 
of the channel described in Section |IV-A| The input symbols are mapped to 
points shown in black whereas the observations are mapped to points shown 
in red. The dashed lines depict the decision boundaries according to the MAP 
rule. 



IV. Examples 

In this section we provide some examples of our geomet- 
ric representation for communication channels. In all of the 
examples the alphabet consists of three elements. In other 
words, A = {xi, x 2 , £3}, and N = 3. The examples differ 
in the observation space and/or the probabilistic transition 
mechanism. As explained in the previous section images of 
the alphabet and the observation space lie on a hyperplane V . 
Therefore, we shall only display V in the illustrations. Since, 
N is three V is a two dimensional plane and it is easy to 
display V . We have to skip the straight forward computations 
in the examples for the sake of saving space. 

A. A channel with discrete output 

The first example is a discrete output channel with input 
alphabet A and observation space 1Z = {a, b, c, d, e, /}. Let 
Pr{X|y} is determined by the following table. 



Pr{X\Y} 


a 


b 


c 


d 


e 


f 


Xl 


0.34 


0.33 


0.33 


0.335 


0.335 


0.33 


X2 


0.33 


0.34 


0.33 


0.335 


0.33 


0.335 


X3 


0.33 


0.33 


0.34 


0.33 


0.335 


0.335 



The corresponding plane V is illustrated in Figure Q] 

B. Additive Gaussian noise channel 

Suppose that x\ = 0, X2 = 1, and X3 = — 1. Let Y = X+N 
where N is a zero mean normally distributed random variable 
of variance a 2 . The observation space 1Z in this problem is the 
whole real line. If X is assumed to be equally likely distributed 
then Pr{X|y} is given as 



Pr{X 



\Y = y} 



exp(- 



. \y- 



'-) 



Ei=i ex p(- 



\y~ x j\~ 



A4k {y} can be computed from ([L31 after a few steps of 
derivations as 



M n {y} = ^[ 2 



6y - 1 -6y-l f 



(22) 



The corresponding plane V is shown in Figure [2] Notice that 
image of 1Z is a line in this case. The M-ji {y} goes upwards 
in the figure with increasing y. The line moves towards right 
with decreasing a 2 . 



Fig. 2. The plane V corresponding to the Euclidean geometric representation 
of the channel described in Section IIV-BI 




Fig. 3. The plane V corresponding to the Euclidean geometric representation 
of the channel described in Section IIV-CI 



C. Additive Laplacian noise channel 

Consider the very same problem in Section IIV-BI except 
N is distributed with a Laplacian distribution with parameter 
A. We have a four piece solution for M-r, {y}. We omit the 
derivation and show the resulting V for a specific A in Figure 
[3] Notice that all y > 1 are mapped to the same point, since, 
all y > 1 results with the same a posteriori probability of X. 

V. Summary and Possible Applications 

We have presented an Euclidean geometric representation 
for the classical detection problem by mapping the hypotheses 
and observations to M. N , We tried to make this representation 
useful for the scenarios in which a sequence of symbols is 
transmitted through the channel. Therefore, our representation 
may prove useful for applying coding theory to arbitrary 
channels. 

An important possible application might be code design 
for arbitrary channels. It is well known that increasing the 
minimum Hamming distance of a code does not always 
increase the performance of the code. We claim that the 
distance measure defined below is a good candidate to be 
optimized while designing codes for arbitrary channels. 



d v (c 1 ,c 2 ) 



E 



\M n M{Y}-M A M{c 2 }\\ 2 \X = Cl 



where Ci and c 2 are two codewords of length M and E[.\] 
denotes conditional expectation. Moreover, d v (. 1 .) can be 
computed simply in terms of a symbol-wise distance as 



d v (c ll c 2 ) = ^d s ((ci)i, (c 2 )i) 



where d s (xi, xj) is E \\Mn {Y} — Ma {%j}\\ \X = x t 

Another possible application might be estimating the per- 
formance of non-binary repetition codes which may be serious 



problem in some channels. Equation ( fT0T > may prove useful for 
this problem. 
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